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medial-axis-based,  which  requires  a  perfect  bounding  contour.  Diffusion  processes  are  central  to  neurobiology,  and  we  have  discovered  how 
to  use  them  to  bridge  the  gap  between  (local)  edges  and 
(global)  descriptions  for  matching.  We  have  proved  that  the 

equilibria  of  these  distributions  signals  infonnation  from  the  distance  map  that  underlies  medial  axis  computations.  This  equilibrium 
distribution  therefore  makes  explicit  global  pattern  from  local  features  and  can  be  used  for  matching.  We  have  shown  how  this  provides  a 
novel  solution  to  detecting  airports  and  other  "complex  features"  in  imagery,  and  how  it  suggests  a  novel  solution  to  the  border-ownership 
problem  in  neuroscience. 
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Background 


There  are  a  range  of  target  recognition  problems,  starting  with 
extremely  simple  ones  such  as  bar  code  recognition  and  extending 
through  more  difficult  problems  such  as  face  detection  and 
recognition.  Finally,  there  are  those  problems  of  interest  to  the 
military,  security,  medical  and  surveillance  communities  that  are  less 
well  defined.  They  involve  notions  of  detecting  a  "threat"  or  a 
change  in  an  "airport"  but  without  proper  computational  definitions 
of  them. 

Approaches  to  all  of  these  problems  begin  with  the  idea  that  local  features  can  be  aggregated  into  a  kind  of 

hierarchy,  with  smaller,  more  local  features  giving  rise  to  slightly  less  local  ones,  and  so  on  until  a  classifier  stage  is  reached. 

Significant  steps  along  these  lines  include  Low  [1],  Viola  [2]  and  LeCunn  [3]. 

Poggio  and  colleagues  [4]  have  applied  a  similar  methodology  to  object 
recognition,  but  with  a  biological  motivation.  Building  on  the  idea 
that  the  visual  parts  of  the  brain  consist  of  a  "hierarchy"  of 
different  areas,  and  assuming  that  each  area  builds  a  description  that 
is  more  abstract  and  global  than  the  previous  one,  a  staged  linear 
combination  is  combined  with  the  MAX  operation  to  emulate  the 
feedforward  pathways  for  object  recognition.  However  this  system  is 
effectively  the  same  in  its  capability  to  those  above. 

Problem  Statement 

We  address  the  target  recognition  problem  by  focusing  on  intermediate-level  vision.  Early 

biological  vision  extracts  edges  and  contours  of  various  lengths,  but 

these  essentially  describe  local  information.  High-level  recognition 

is  either  template-based,  which  is  fragile  with  respect  to  lighting, 

size,  or  clutter;  or  medial-axis-based,  which  requires  a  perfect 

bounding  contour.  Intermediate-level  vision  must  bridge  this  gap. 

The  heart  of  our  approach  can  be  described  in  two  ways.  First, 
mathematically,  we  have  discovered  a  theorem  about  how  aspects  of  the 
distance  map,  a  central  component  to  shape  descriptions,  can  be  "read 
out"  of  the  equilibrium  of  a  diffusion  process  deriving  from  edge 
arrangements.  From  a  neurobiological  perspective,  it  incorporates 
feedforward  and  feedback  projections,  both  of  which  are  fundamental, 
and  does  so  in  a  novel  fashion. 

More  generally,  diffusion  processes  are  central  to  neurobiology,  and  we  have 
discovered  how  to  use  them  to  bridge  the  gap  between  (local)  edges  and 
(global)  descriptions  for  matching.  Suppose  edges  could 
produce  an  abstract  "substance"  which  diffuses  through  surrounding 
substrate  until  equilibrium.  We  have  proved  that  the 

equilibria  of  these  distributions  approximate  the  distance  map  that  underlies  medial  axis 
computations.  This  equilibrium  distribution  therefore  makes  explicit  global 
pattern  from  local  features  and  can  be  used  for  matching.  Our  goal 
is  to  exploit  this  for  recognition  by  defining  features  on  this  new  map. 

If  successful,  new,  improved  and  extremely  efficent  ATR  algorithms 
should  arise  from  these  biologically-based  techniques. 

Summary  of  the  Most  Important  Results 

1 .  Development  of  a  system  of  reaction-diffusion  differential 
equations  to  show  how  area  effects  can  be  integrated  into  "action  at 
a  distance"  phenomena. 


2.  Applied  this  system  to  a  problem  in  biology:  what  is  the  signal 
that  ground  cells  send  to  vein  cells  to  initiate  new  vein  formation? 

3.  Applied  this  system  to  a  problem  in  computer  vision:  detection  of 
airports  from  high-altitude  imagery.  To  our  knowledge  this  system  is 
the  first  to  formulate  a  formal  approach  to  such  "complex  feature 
detection"  problems. 
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Appendix 

During  the  year  we  have  extended  in  detail  our  a  model  for  diffusion; 
we  have  written  several  papers  about  it  that  relate  to  both  biology 
(predicting  the  manner  in  which  veins  develop  in 

plants)  and  computer  vision  (definition  of  complex  features  such  as  airports). 

Such  configurations  are  difficult  to  define  recognition 
procedures  for  because,  while 

airports  have  runways  and  buildings,  they  need  not  be  in  precisely  the  same 
or  even  approximate  relationships.  Thus  existing  computer  vision  techniques  cannot 
be  applied.  We  are  continuing  to  develop 

the  analogy  with  plants  and  we  are  exploring  the  distance-map  information 
and  what  this  implies  for  complex  features.  Our  major  new  work  in  the 
past  year  was  the 

extraction  of  basic  information  from  these  "distance  images"  using  non-linear  dimensionality 

reduction  techniques.  The  surprising  result  is  that  a  manifold  emerges 

with  the  principle  eigenfunctions  spanning  density  and  edge 

orientation,  somewhat  analagously  to  the  (intensity,  hue,  saturation) 

representation  of  color.  Current  work  is  defining  classifiers  for 

airports  on  this  manifold. 
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